Picture for Yelong Shen

Yelong Shen

Reinforcement World Model Learning for LLM-based Agents

Add code
Feb 05, 2026
Viaarxiv icon

Test-time Recursive Thinking: Self-Improvement without External Feedback

Add code
Feb 03, 2026
Viaarxiv icon

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

Add code
Feb 02, 2026
Viaarxiv icon

RLBR: Reinforcement Learning with Biasing Rewards for Contextual Speech Large Language Models

Add code
Jan 19, 2026
Viaarxiv icon

Training Matryoshka Mixture-of-Experts for Elastic Inference-Time Expert Utilization

Add code
Sep 30, 2025
Viaarxiv icon

SAS: Simulated Attention Score

Add code
Jul 10, 2025
Viaarxiv icon

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Add code
Jul 09, 2025
Figure 1 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 2 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 3 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 4 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Viaarxiv icon

PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning

Add code
Jun 17, 2025
Viaarxiv icon

SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning

Add code
Jun 10, 2025
Viaarxiv icon

SoK: Are Watermarks in LLMs Ready for Deployment?

Add code
Jun 05, 2025
Figure 1 for SoK: Are Watermarks in LLMs Ready for Deployment?
Figure 2 for SoK: Are Watermarks in LLMs Ready for Deployment?
Figure 3 for SoK: Are Watermarks in LLMs Ready for Deployment?
Figure 4 for SoK: Are Watermarks in LLMs Ready for Deployment?
Viaarxiv icon